Conversation
|
It's happening! |
Owner
Author
|
Absolutely! 🚀 |
|
Any updates? Is it usable? |
Owner
Author
|
Hey @yukiarimo It's almost done. I just want to run some extra tests (QLoRA, full finetune) and finish Qwen2-VL to release it. |
1. [IMG_BREAK] and [IMG_END] are lost after embedding
2. image position encode should be done per image base
https://github.com/mistralai/mistral-inference/blob/main/src/mistral_inference/vision_encoder.py#L85
https://github.com/huggingface/transformers/blob/main/src/transformers/models/pixtral/modeling_pixtral.py#L492
Co-authored-by: Roger Xu <rogerxu@gmail.com>
This was referenced Oct 11, 2024
Garry-TI
pushed a commit
to Garry-TI/mlx-vlm
that referenced
this pull request
Sep 23, 2025
* remove torch and mlx-lm * remove torch and mlx-lm * add peft model creation * use tree flatten * add dataset loader * fix dataset * fix masks and rename dataset * support batch processing and train on completions * fix trainer * formatting * add support for none splits and fix assistant id * Add lora script and docs * remove torch and mlx-lm * add peft model creation * use tree flatten * add dataset loader * fix dataset * fix masks and rename dataset * support batch processing and train on completions * fix trainer * formatting * add support for none splits and fix assistant id * Add lora script and docs * remove duplicates * fix batch load * load trained adapters and add super to all models * fix pixtral quant * speed up qwen batch processing * fix qlora training * fix dataloader * formatting * fix pixtral pixel loading * fix lora and dataset * add batch processing suppor for qwen2_vl * update lora docs * add unit tests * set stage for phi3_v support * update logs and readme * add utils tests and remove unused collate fn * refactor prompt utils and add multi-image support for pixtral * add llava interleave support * multi image support * add image resizing * refactor data loading * update data procesing and tqdm * add llava interleave * formatting * add list of models with multi-image support * remove trimmed labels * remove warning * pin reqs * add config dict condition * fix pixtral FT prompt * formatting images * remove unused * update trainer init * update lora * update md and formatting * bump version * add tests for pixtral and qwen2_vl * add tests for pixtral * Merge branch 'pc/tuner' of https://github.com/Blaizzy/mlx-vlm into pc/tuner * fix test * remove rope scaling * remove test args and update MD * format dataset defaults * add dataset formatting info * Fix issues with multiple image handling (Blaizzy#78) 1. [IMG_BREAK] and [IMG_END] are lost after embedding 2. image position encode should be done per image base https://github.com/mistralai/mistral-inference/blob/main/src/mistral_inference/vision_encoder.py#L85 https://github.com/huggingface/transformers/blob/main/src/transformers/models/pixtral/modeling_pixtral.py#L492 Co-authored-by: Roger Xu <rogerxu@gmail.com> * fix styling * update model * update default model * rewrite comments --------- Co-authored-by: hiima234 <98786318+hiima234@users.noreply.github.com> Co-authored-by: Roger Xu <rogerxu@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR adds:
New Models
Closes #73 #69